Hierarchical Taxonomy Extraction by Mining Topical Query Sessions
نویسندگان
چکیده
Search engine logs store detailed information on Web users interactions. Thus, as more and more people use search engines on a daily basis, important trails of users common knowledge are being recorded in those files. Previous research has shown that it is possible to extract concept taxonomies from full text documents, while other scholars have proposed methods to obtain similar queries from query logs. We propose a mixture of both lines of research, that is, mining query logs not to find related queries nor query hierarchies but actual term taxonomies. In this first approach we have researched the feasibility of finding hyponymy relations between terms or noun-phrases by exploiting specialization search patterns in topical sessions, obtaining encouraging preliminary results.
منابع مشابه
Mining Diagnostic Taxonomy for Multi-Stage Medical Diagnosis
Experts’ reasoning selects the final diagnosis from many candidates by using hierarchical differential diagnosis. In other words, candidates gives a sophisticated hiearchical taxonomy, usually described as a tree. In this paper, the characteristics of experts’ rules are closely examined from the viewpoint of hierarchical decision steps and and a new approach to rule mining with extraction of di...
متن کاملDetecting User Sessions in the Tumba! Query Log
This paper describes an approach to detect distinct user sessions from the logs of a particular search engine. We present our work by describing the proposed algorithm and some interesting usage patterns that were detected. Some pitfalls of our approach are also noted. Finally, we give some insights on how web log mining could be exploited in other areas such as semantic relation extraction or ...
متن کاملMining Large Query Induced Graphs towards a Hierarchical Query Folksonomy
The human interaction through the web generates both implicit and explicit knowledge. An example of an implicit contribution is searching, as people contribute with their knowledge by clicking on retrieved documents. Thus, an important and interesting challenge is to extract semantic relations among queries and their terms from query logs. In this paper we present and discuss results on mining ...
متن کاملIdentifying Structures in Social Conversations in NSCLC Patients through the Semi-Automatic extraction of Topical Taxonomies
The exploration of social conversations for addressing patient’s needs is an important analytical task in which many scholarly publications are contributing to fill the knowledge gap in this area. The main difficulty remains the inability to turn such contributions into pragmatic processes the pharmaceutical industry can leverage in order to generate insight from social media data, which can be...
متن کاملAutomatic Extraction of Structurally Coherent Mini-Taxonomies
In this paper we demonstrate an automatic approach for emergent semantics modeling of ontologies. We follow the collaborative ontology construction method without the direct interaction of domain users, engineers or developers. A very important characteristic of an ontology is its hierarchical structure of concepts. Semantic web is heavily dependent on the XML paradigm, which inherently follows...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009